A General Theory of Goodness of Fit in Likelihood Fits

نویسنده

  • Rajendran Raja
چکیده

Maximum likelihood fits to data can be performed using binned data and unbinned data. The likelihood fits in either case produce only the fitted quantities but not the goodness of fit. With binned data, one can obtain a measure of the goodness of fit by using the χ2 method, after the maximum likelihood fitting is performed. With unbinned data, currently, the fitted parameters are obtained but no measure of goodness of fit is available. This remains, to date, an unsolved problem in statistics. By considering the transformation properties of likelihood functions with respect to change of variable, we conclude that the likelihood ratio of the theoretically predicted probability density to that of the data density is invariant under change of variable and provides the goodness of fit. We show how to apply this likelihood ratio for binned as well as unbinned likelihoods and show that even the χ2 test is a special case of this general theory. In order to calculate errors in the fitted quantities, we need to solve the problem of inverse probabilities. We use Bayes’ theorem to do this, using the data density obtained in the goodness of fit. This permits one to invert the probabilities without the use of a Bayesian prior. The resulting statistics is consistent with frequentist ideas. Preprint submitted to Elsevier Science 2 February 2008

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Measure of the Goodness of Fit in Unbinned Likelihood Fits; End of Bayesianism?

Maximum likelihood fits to data can be done using binned data (histograms) and unbinned data. With binned data, one gets not only the fitted parameters but also a measure of the goodness of fit. With unbinned data, currently, the fitted parameters are obtained but no measure of goodness of fit is available. This remains, to date, an unsolved problem in statistics. Using Bayes’ theorem and likel...

متن کامل

A Measure of the Goodness of Fit in Unbinned Likelihood Fits

Abstract Maximum likelihood fits to data can be done using binned data (histograms) and unbinned data. With binned data, one gets not only the fitted parameters but also a measure of the goodness of fit. With unbinned data, currently, the fitted parameters are obtained but no measure of goodness of fit is available. This remains, to date, an unsolved problem in statistics. Using Bayes theorem a...

متن کامل

Finite Sample Corrections for Parameters Estimation and Significance Testing

An increasingly important problem in the era of Big Data is fitting data to distributions. However, many stop at visually inspecting the fits or use the coefficient of determination as a measure of the goodness of fit. In general, goodness-of-fit measures do not allow us to tell which of several distributions fit the data best. Also, the likelihood of drawing the data from a distribution can be...

متن کامل

Generalization of Chisquare Goodness-of-Fit Test for Binned Data Using Saturated Models, with Application to Histograms

This note is a quick review of a generalization of the chisquare goodnessof-fit test for the situation when the data are not Gaussian (as for example histogram bin contents). The generalization, already in use for many years, is based on the likelihood ratio test in which one uses in the denominator a saturated model, i.e., a model that fits the data exactly. As with the Gaussian test (and in f...

متن کامل

Comparison of experimental data to Monte Carlo simulation—Parameter estimation and goodness-of-fit testing with weighted events

Relations are derived that can be used to infer parameters in situations where theoretical predictions can be compared to experimental data only indirectly via Monte Carlo simulation. We consider least square and likelihood ratio fits. Parameter changes in the fitting procedures are performed by reweighting Monte Carlo events. Formulas for goodness-of-fit tests based on the w2 and the likelihoo...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005